DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...690
Hits 1 – 20 of 13.783

1
RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
In: IEEE ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03539741 ; IEEE ICASSP 2022, 2022, Singapour, Singapore (2022)
BASE
Show details
2
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
BASE
Show details
3
One model for the learning of language.
In: Proceedings of the National Academy of Sciences of the United States of America, vol 119, iss 5 (2022)
BASE
Show details
4
Thirty Years of Machine Translation in Language Teaching and Learning: A Review of the Literature
In: L2 Journal, vol 14, iss 1 (2022)
BASE
Show details
5
Assessing the impact of OCR noise on multilingual event detection over digitised documents
In: ISSN: 1432-5012 ; EISSN: 1432-1300 ; International Journal on Digital Libraries ; https://hal.archives-ouvertes.fr/hal-03635985 ; International Journal on Digital Libraries, Springer Verlag, 2022, ⟨10.1007/s00799-022-00325-2⟩ (2022)
Abstract: International audience ; Event detection (ED) is a crucial task for natural language processing (NLP) and it involves the identification of instances of specified types of events in text and their classification into event types. The detection of events from digitised documents could enable historians to gather and combine a large amount of information into an integrated whole, a panoramic interpretation of the past. However, the level of degradation of digitised documents and the quality of the optical character recognition (OCR) tools might hinder the performance of an event detection system. While several studies have been performed in detecting events from historical documents, the transcribed documents needed to be hand-validated which implied a great effort of human expertise and manual labor-intensive work. Thus, in this study, we explore the robustness of two different event detection language-independent models to OCR noise, over two datasets that cover different event types and multiple languages. We aim at analysing their ability to mitigate problems caused by the low quality of the digitised documents and we simulate the existence of transcribed data, synthesised from clean annotated text, by injecting synthetic noise. For creating the noisy synthetic data, we chose to utilise four main types of noise that commonly occur after the digitisation process: Character Degradation, Bleed Through, Blur, and Phantom Character. Finally, we conclude that the imbalance of the datasets, the richness of the different annotation styles, and the language characteristics are the most important factors that can influence event detection in digitised documents.
Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]; [INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]; [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; Digitised Documents; Event Detection; Information Extraction
URL: https://hal.archives-ouvertes.fr/hal-03635985/file/IJDL2022-Assessing%20the%20Impact%20of%20OCR%20Noise%20on%20Multilingual%20Event%20Detection%20over%20Digitised%20Documents.pdf
https://doi.org/10.1007/s00799-022-00325-2
https://hal.archives-ouvertes.fr/hal-03635985/document
https://hal.archives-ouvertes.fr/hal-03635985
BASE
Hide details
6
MAGIC DUST FOR CROSS-LINGUAL ADAPTATION OF MONOLINGUAL WAV2VEC-2.0
In: ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03544515 ; ICASSP 2022, May 2022, Singapour, Singapore (2022)
BASE
Show details
7
Introducing the HIPE 2022 Shared Task: Named Entity Recognition and Linking in Multilingual Historical Documents
In: Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II ; https://hal.archives-ouvertes.fr/hal-03635971 ; Matthias Hagen; Suzan Verberne; Craig Macdonald; Christin Seifert; Krisztian Balog; Kjetil Nørvåg; Vinay Setty. Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II, 13186, Springer International Publishing, pp.347-354, 2022, Lecture Notes in Computer Science, 978-3-030-99738-0. ⟨10.1007/978-3-030-99739-7_44⟩ (2022)
BASE
Show details
8
Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?
In: Seventh Workshop on Noisy User-generated Text (W-NUT 2021, colocated with EMNLP 2021) ; https://hal.inria.fr/hal-03527328 ; Seventh Workshop on Noisy User-generated Text (W-NUT 2021, colocated with EMNLP 2021), Jan 2022, punta cana, Dominican Republic ; https://aclanthology.org/2021.wnut-1.47/ (2022)
BASE
Show details
9
Cross-lingual few-shot hate speech and offensive language detection using meta learning
In: ISSN: 2169-3536 ; EISSN: 2169-3536 ; IEEE Access ; https://hal.archives-ouvertes.fr/hal-03559484 ; IEEE Access, IEEE, 2022, 10, pp.14880-14896. ⟨10.1109/ACCESS.2022.3147588⟩ (2022)
BASE
Show details
10
Annotation of Morphological Errors in L2 Russian Corpus Analysis
In: 21st Annual Second Language Acquisition and Teaching Interdisciplinary Roundtable ; https://hal.archives-ouvertes.fr/hal-03620469 ; 21st Annual Second Language Acquisition and Teaching Interdisciplinary Roundtable, University of Arizona, Feb 2022, Tucson, United States (2022)
BASE
Show details
11
Cross-Situational Learning Towards Robot Grounding
In: https://hal.archives-ouvertes.fr/hal-03628290 ; 2022 (2022)
BASE
Show details
12
Cross-Situational Learning Towards Robot Grounding
In: https://hal.archives-ouvertes.fr/hal-03628290 ; 2022 (2022)
BASE
Show details
13
A Methodology for the Comparison of Human Judgments With Metrics for Coreference Resolution
In: HumEval at ACL ; https://hal.archives-ouvertes.fr/hal-03650294 ; HumEval at ACL, May 2022, Dublin, Ireland ; https://humeval.github.io/ (2022)
BASE
Show details
14
Le modèle Transformer: un « couteau suisse » pour le traitement automatique des langues
In: Techniques de l'Ingenieur ; https://hal.archives-ouvertes.fr/hal-03619077 ; Techniques de l'Ingenieur, Techniques de l'ingénieur, 2022, ⟨10.51257/a-v1-in195⟩ ; https://www.techniques-ingenieur.fr/base-documentaire/innovation-th10/innovations-en-electronique-et-tic-42257210/transformer-des-reseaux-de-neurones-pour-le-traitement-automatique-des-langues-in195/ (2022)
BASE
Show details
15
The use of MT by undergraduate translation students for different learning tasks
In: https://hal.archives-ouvertes.fr/hal-03547415 ; 2022 (2022)
BASE
Show details
16
Formulaic Expressions for Foreign Language Learning and Teaching
In: ISSN: 1615-3014 ; Linguistik Online ; https://hal.archives-ouvertes.fr/hal-03562566 ; Linguistik Online, Bern Open Publishing, 2022, Vermischtes/Miscellaneous, 113 (1), pp.91-110 ; https://bop.unibe.ch/linguistik-online (2022)
BASE
Show details
17
КОНТРОЛЬ КАК ОСНОВА ЭФФЕКТИВНОГО ОБУЧЕНИЯ ИНОСТРАННОМУ ЯЗЫКУ СТУДЕНТОВ НЕЯЗЫКОВЫХ ВУЗОВ ... : CONTROL AS A BASIS FOR EFFECTIVE FOREIGN LANGUAGE TEACHING OF STUDENTS IN NON-LINGUISTIC UNIVERSITIES ...
И.Ф. Мусаелян. - : Мир науки, культуры, образования, 2022
BASE
Show details
18
МОНОЛОГИЧЕСКАЯ РЕЧЬ С ТОЧКИ ЗРЕНИЯ УЧЁНЫХ ... : MONOLOGICAL SPEECH FROM THE POINT OF VIEW OF SCIENTISTS ...
Н. И. Шадманова. - : Academic research in educational sciences, 2022
BASE
Show details
19
АКТУАЛЬНЫЕ ТЕНДЕНЦИИ ЦИФРОВИЗАЦИИ ИНОЯЗЫЧНОГО ОБУЧЕНИЯ В НЕЯЗЫКОВОМ ВУЗЕ ... : CURRENT TRENDS IN DIGITALIZATION OF FOREIGN LANGUAGE EDUCATION IN A NON-LINGUISTIC UNIVERSITY ...
Е.Б. Манахова. - : Мир науки, культуры, образования, 2022
BASE
Show details
20
THE ROLE OF LISTENING IN LANGUAGE ACQUISITION; THE CHALLENGES & STRATEGIES IN TEACHING LISTENING ... : РОЛЬ СЛУШАНИЯ В ОФОРМЛЕНИИ ЯЗЫКА; ПРОБЛЕМЫ И СТРАТЕГИИ ОБУЧЕНИЯ АУДИРОВАНИЮ ...
Zokirova, Zulkhumor. - : Oriental renaissance: Innovative, educational, natural and social sciences, 2022
BASE
Show details

Page: 1 2 3 4 5...690

Catalogues
517
4
412
0
2
0
22
Bibliographies
2.117
0
0
0
0
0
0
5
50
Linked Open Data catalogues
0
Online resources
73
17
0
0
Open access documents
11.476
5
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern